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We demonstrate a first-order phase transition from non-life to life, defined as non-replicating and 
replicating systems respectively, and characterize some of its dynamical properties. The model dif¬ 
fers from those described previously in that we explicitly couple replicators to their environment 
through the recycling of a finite supply of resources. We find that the environment plays a central 
role in defining the dynamics of the transition. The phase transition corresponds to a redistri¬ 
bution of matter in replicators and their environment, driven by selection on replicators. In the 
absence of successfully repartitioning system resources, the transition fails to complete, leading to 
the possibility of many frustrated trials before life first emerges. The mutual information shared be¬ 
tween replicators and environment accurately tracks the progress of the phase transition. The phase 
transition is marked by a sequence of abrupt transitions whereby replicators become increasingly 
distinct from their environment. Often, the replicators that nucleate the transition in the non-life 
phase are not those that are ultimately selected in the life phase. During the phase transition the 
system experiences an explosive growth in diversity. We discuss the implications of these results for 
understanding life’s emergence and evolutionary transitions more broadly. 
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INTRODUCTION 

Life is a state of matter characterized by a stable pat¬ 
tern of non-equilibrium behavior. Replication, central to 
maintenance of pattern, is thought to play an important 
role in driving the transition from non-living to living 
matter mu- Numerous theoretical studies have there¬ 
fore focused on the emergence of the first replicators, 
including identifying the conditions under which repli¬ 
cators can be selected. Of note is the transition from 
“pre-life” to “life” observed by Nowak and collaborators, 
where pre-life is defined as a generative chemistry with 
no replication, to be contrasted with life, which replicates 
and evolves EH5]- By tuning model parameters, a tran¬ 
sition is observed where, above a critical replication rate, 
replicators are selected [3 . Similar features have been 
noted by Wu et al. i!Z] and others [8j. While several 
of these studies have discussed the possibility of a phase 
transition associated with this behavior, the underlying 
properties of the transition remain to be fully explicated. 


In the present study, we demonstrate a spontaneous, 
first-order phase transition from non-life to life, defined as 
non-replicating and replicating systems respectively, and 
establish a central role for the environment in defining the 
properties of this transition. A prominent feature of pre- 
biotic chemistry, as evidenced by more than sixty years of 
experiments in prebiotic synthesis, is that it is difficult to 
abiotically synthesize biomolecules with high yield under 
early Earth conditions [5]. Accordingly, we regard re¬ 
source limitation to be a common, and likely important, 
feature of prebiotic environments. Indeed, previous stud¬ 
ies have indicated that finite resources could have been 
an important factor in driving the emergence of life 03- 
1131 . We therefore consider a model prebiotic chemistry 
with a finite supply of monomers, which must be recy¬ 
cled through polymer degradation to replenish resources 
available for synthesis of new polymers. We are moti¬ 
vated by empirical evidence that recycling, mediated by 
coupled polymerization and degradation reactions, may 
have been an important mechanism driving the early evo- 
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lution of biopolymers [HHH1- We note that the majority 
of theoretical models for the emergence of replicators, by 
contrast, implement reactor flows with a constant flux 
of monomers into the system and removal of chemical 
species via dilution (see e.g. [5) (3JE])- The explicit in¬ 
corporation of non-linear feedback between environment 
and replicators via degradative recycling presented here 
therefore distinguishes our computational model from 
most studied previously. 

Due to environmental feedback, the phase transition 
reported here exhibits features not observable in other 
models for the emergence of the first replicators. Most 
notably, the phase transition corresponds to a redistribu¬ 
tion of matter in replicators and the environment , driven 
by selection on replicators. The mutual information 
shared between replicators and environment accurately 
tracks the progress of the phase transition. The transi¬ 
tion has a higher probability to occur when replicators 
and environment share relatively high mutual informa¬ 
tion. Selection on the fitness of replicators, including 
their replicative efficiency and stability as studied here, 
occurs only in the life phase. The first replicator(s) that 
appear typically match the bulk composition of their en¬ 
vironment. Since the composition of the environment 
does not in general match that of “fit” replicators (the en¬ 
vironment is not fine-tuned for life), the replicators that 
nucleate the transition in the non-life phase are often 
not those that are ultimately selected in the life phase. 
Due to resource constraints the phase transition proceeds 
through a sequence of abrupt transitions whereby the 
composition of replicators becomes increasingly distinct 
from that of their environment. During the phase tran¬ 
sition the system experiences an explosive growth in di¬ 
versity, and concomitantly a massive extinction of extant 
chemical species to accommodate restructuring driven by 
the selection on replicators. Extant diversity and the rate 
of exploration of novel diversity is higher after the tran¬ 
sition than before. In the absence of successfully reparti¬ 
tioning system resources, the transition fails to complete, 
leading to the possibility of many frustrated trials before 
life first emerges. We discuss the implications of these 
results for understanding the emergence of life and evo¬ 
lutionary transitions more broadly. 

METHODS 

Model Description and Motivation 

We model the emergence of replicators in an artifi¬ 
cial prebiotic chemistry consisting of two monomer types 
denoted by ‘0’ and T’. Polymerization occurs via ad¬ 
dition of monomers to the end of growing strings. Se¬ 
quences can degrade into shorter sequences, which can 
occur at any bond within a given sequence with equal 
probability. We assume that the inverse process of two 


short but non-monomeric sequences ligating to produce 
a longer polymer is sufficiently rare to be neglected. Se¬ 
quences of length L > r can self-replicate. This minimal 
replicator length approximates a minimal complexity for 
self-replication in our simplified model. We note that 
the properties reported here are general and qualitatively 
similar for any r, where r primarily determines the rel¬ 
ative timescale for discovering replicators, and thus for 
the phase transition to occur (described below). In this 
study, we set r = 7, such that the appearance of the 
first replicators is rare, but not so rare that we never 
observe it [BJ. We expect that changing r will change 
the timescale of the transition but will not qualitatively 
effect the results presented here. 

In the absence of an imposed fitness landscape (de¬ 
fined below), the properties of our chemistry are fully 
specified by the rate constants fc p , kd and k r for polymer¬ 
ization, degradation and replication, respectively. We re¬ 
gard these parameters, along with the abundances of ’0’ 
and T’ monomers, as fully specifying the prebiotic envi¬ 
ronment in our model chemistry. Thus, for example, a 
difference in temperature defining two different prebiotic 
environments would correspond in our system to simu¬ 
lations conducted with two different kd values, that is, 
if temperature affects the degradation rate of polymers 
for the particular chemistry of interest (e.g. which might 
perhaps depend on the backbone chemistry of biopoly¬ 
mers, see [12] for discussion). 

Since we are interested in the dynamics of replication 
in this work, and specifically the transition from non¬ 
life to life, we do not include the effects of mutation, 
which is well known to play an important role in evolu¬ 
tion once life has already emerged. Therefore a simplifi¬ 
cation in our model is that replication only functions to 
copy extant sequences, and does not produce any novelty. 
Novelty is introduced through the prebiotic processes of 
polymerization and degradation. As long as mutations 
also obey the principles of resource constraints, we ex¬ 
pect that their primary effect would be to increase the 
search rate for replicator(s) that match their environ¬ 
ment, which, as we discuss below, nucleates the phase 
transition described here. 

Simulations were implemented using a kinetic Monte 
Carlo algorithm mi nu. For more detailed discussion 
of the implementation of that algorithm in prebiotic re¬ 
cycling chemistries we refer the reader to [12] or [El- 
In what follows, the polymerization, degradation, and 
replication rate constants were set to k p = 0.0005, kd = 
0.5000, and k r = 0.0050 respectively, and the system was 
initialized with 500 monomers each of ‘0’ and T’ with no 
polymers present, unless otherwise noted. It is impor¬ 
tant to note that since this is a closed system, the initial 
conditions specify the bulk composition of the system for 
all time. 
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Two Fitness Landscapes: Static and Dynamic 


To explicitly couple the properties of replicators to that 
of their environment, we model the fitness of replicators 
as determined by two factors: 

1. a static fitness associated with a trade-off between 
stability and replicative efficiency intrinsic to indi¬ 
vidual replicators , and 

2. a dynamic fitness associated with resource avail¬ 
ability in the environment . 


The first introduces a non-dynamic component of the fit¬ 
ness landscape associated with the properties of individ¬ 
ual replicators that is a common feature of origin of life 
models. The latter environmentally-dictated fitness is 
dynamic and a unique feature of the resource-dependent 
replication model presented here (see also [12[ or [15]). 

Static Fitness. The trade-off between stability and 
replicative efficiency of sequences captures features of nu¬ 
cleic acid systems believed to play an important role in 
early evolution- in particular that molecules that fold 
well are typically not good templates and conversely that 
good templates often do not fold well and are thus less 
resistant to degradation mm- The mathematical form 
this trade-off implemented here is inspired by that of 113, 
in that we encode both the stability and replicative effi¬ 
ciency of replicators utilizing a sigmoid function: 


f(n) = 0.5 + 


2(10+ n 2 )' 


( 1 ) 


The replicative fitness of a sequence Xi with length 
L is quantified by scaling its replication rate by 1 + 
f(zeros{xi)). Similarly, the stability of sequence Xi 
is determined by scaling the degradation rate by 1 — 
/ (ones(xi)). Thus, for sequences of a given length L > 7, 
the homogeneous string of ‘l’s is the most stable se¬ 
quence of that length, the homogeneous string of ‘0’s 
is the fastest replicator, and sequences with a roughly 
equal number of ‘0’s and ‘l’s best balance stability with 
replicative efficiency. This landscape is constructed to 
reflect two features we regard as important to the gener¬ 
ally unknown shape of biopolymer fitness landscapes: the 
fittest sequences are much rarer than less fit sequences 
and their composition is not necessarily reflective of the 
bulk composition of their environment. Sequences with 
L < 7 do not replicate, so only the stability landscape 
is relevant for short sequences. This trade-off establishes 
a fitness landscape intrinsic to a polymer’s specific se¬ 
quence composition that is fixed within a given environ¬ 
mental context. 

Dynamic Fitness. Replication rates are also dynami¬ 
cally determined by the availability of free monomers in 
the environment. The replication rate for sequence Xi 
is weighted by a factor ^2n- 1 VniVni+i, where y n is the 


abundance of the monomer species at position n in se¬ 
quence Xi. This term yields a resource-dependent repli¬ 
cation rate that is also sequence dependent, similar to 
that implemented by one of us (SIW) in [T5] where anal¬ 
ogous dynamics to the phase transition reported herein 
were observed, although not characterized as such. The 
functional form of the resource dependence is an approx¬ 
imation intended to model nucleation of the first bond 
formed on a template as the rate-limiting step, while 
simultaneously capturing sequence information by sum¬ 
ming over all possible nucleation events on the template, 
as introduced in m- Thus, in our model the replica¬ 
tion rate of a given sequence Xi depends in part on how 
well its sequence composition matches the relative abun¬ 
dances of ‘0’ and ‘I’ monomers in the environment. Since 
the abundances of ‘0’ and ‘1’ monomers change over time 
as monomers are consumed via polymerization and repli¬ 
cation and generated via degradation, this creates an en¬ 
vironmentally dictated fitness landscape that is a central 
feature of any resource-constrained dynamics. We expect 
qualitative features of the dynamics observed here to be 
a general feature of sequence-specific resource-dependent 
replication that does not depend specifically on the form 
of dynamic landscape chosen for this study and will be 
qualitatively similar for other replicator models where 
stoichiometry plays a role in setting the efficiency of repli¬ 
cation. 


Mutual Information as a Measure of the Transition 
from Non-life to Life 


To characterize the dynamics of the phase transition, 
we employ mutual information, a common tool in infor¬ 
mation theory, which measures the mutual dependence 
of two variables within a dynamic time series. Most of¬ 
ten, mutual information is calculated on time series data. 
Herein, we explicitly measure the mutual information be¬ 
tween two variables as a time-series variable itself to 
track the progress of the phase transition from non-life to 
life. To generate a time series for mutual information we 
use the pointwise mutual information , V. Given two ran¬ 
dom variables X = {x\, X 2 -..x n } and Y = {yi,y 2 ----ym}, 
V is quantified as: 


V(xi : yi) = log 


P{xj,yi) 

P(xi)p(yi ) ‘ 


( 2 ) 


[SI] , where p{xi) and p{yi) are the probabilities of ob¬ 
serving the event where X is in state Xi and Y is in state 
yi, respectively, and p(xi,yi) is the joint probability of 
this event occuring. We generated probability distribu¬ 
tions by counting the frequency of a given event ( e.g. 
abundance of ’0’ and T’ monomers and of replicators of 
a given sequence composition) in our time series data. 
In the results presented here, the distributions were gen¬ 
erated using time series data from an ensemble of 100 
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experimental runs over 10,000 time steps each. To en¬ 
sure that the frequency based probability distributions 
were not biased by counting states from different phases 
of the system (see below), the frequencies were gener¬ 
ated from data that sampled equally from both phases. 
The probabilities of different states therefore represent 
ensemble statistics which do not depend on time, while 
the particular ordering of states in a time series is used to 
determine the time series V. In principle, V(xi : yf) can 
be calculated at every time step. However, in stochastic 
systems, such as ours, V will fluctuate rapidly in time 
and is unlikely to yield useful insights. We therefore sum 
V over a fixed time window to yield the mutual informa¬ 
tion for that window. Explicitly, for a window size of w, 
the mutual information at time t is defined as the average 
of V(xi : yi), and is given by: 


l(X(t) : Y(i)) 


t+(w/ 2) 

^2 P( X i’Vi) 1 °S 

i=t—(w/ 2) 


v( x i,yi) 

p(xi)p(yi) 


.(3) 


122] . This value will depend on time, not because the 
probabilities of different states will depend on time, but 
rather the realization of different states is time ordered. 
Determining an appropriate size for w is important. If 
w is too large, the entire measurement collapses into one 
value yielding no insights into how the system is evolving 
in time. By contrast, if w is too small, fluctuations wash 
out interesting larger scale structure. 

In our system, mutual information was measured be¬ 
tween R and E which are ordered pairs that track the 
number of zeros and number of ones in both replica¬ 
tors ( R ) and in free monomers in the environment (E). 
We chose w heuristically, such that the value of I(R] E), 
tracking the mutual information between replicators and 
environment, was relatively constant but large fluctua¬ 
tions could still be resolved. For the results presented 
w = lOOAt, where At = 0.1 kf 1 is the the resolution of 
the time series data in natural units. We note that differ¬ 
ent values of w change the results quantitatively, but not 
qualitatively: the system still maintains a non-zero value 
of the mutual information in the non-life phase which 
tends toward zero in the life phase (discussed below). 


RESULTS 
Non-life and Life 

Two long-lived states are observed in our model: “non¬ 
life” and “life”, which are dominated by polymerization 
and replication, respectively. While the non-life phase 
here shares features in common with “pre-life” as previ¬ 
ously characterized (3], it also has some striking differ¬ 
ences. We therefore use “non-life” rather than “pre-life” 
as it does not imply life will inevitably emerge since in 
our system many transitions fail to complete. 


In the non-life phase, long sequences are exponentially 
rare, and the majority of system mass is in monomers 
and dimers (not shown). Sequences of all lengths have 
relatively similar composition, as shown in Fig. [T] The 
composition of extant polymers is reflective of the abi¬ 
otic availability of resources and the stability landscape 
established by Eq. [T] In contrast to other models Enzuu, 
here replicators exist in the non-life phase, albeit at expo¬ 
nentially low abundance. However, selection on replica¬ 
tion cannot overcome environmental constraints. Repli¬ 
cators in non-life are not the most fit in terms of stabil¬ 
ity or replicative efficiency (e.g. homogenous ‘O’ or ‘1’ 
sequences, respectively) but instead are predominately 
heterogenous with compositions determined by the bulk 
distribution of resources (right, top panel in Fig. [TJ. 

In the life phase, the symmetry of the environment, 
constituting an equal number of ‘0’ and ‘1’ monomers, is 
broken in the composition of replicators due to selection 
on the static fitness landscape, a feature which is not ob¬ 
served for non-life (compare L = 7 compositions, Fig. [I]). 
In the life phase, replicators are primarily homogeneous 
’l’s or ’0’s. The asymmetry imposed by selection on repli¬ 
cators is exported to shorter sequences, which have the 
opposite compositional signature than that of the repli¬ 
cators. The compositional reversal is seen only below 
L = 6. Although L = 6 sequences cannot replicate, they 
are formed primarily via degradation of L = 7 replicators 
and thus their formation is dominated by templated as¬ 
sembly. In the life phase, we observe that replicators are 
selected based on their intrinsic fitness and not strictly 
their composition. The defining feature of the life phase 
is therefore not necessarily the presence of replicators, 
which exist in both phases. Instead, the defining char¬ 
acteristic of “life” in this model is that the distribution 
of resources is dictated by selection on the properties of 
replicators. 


A First Order Phase Transition from Non-life to Life 

For fixed values of k p , kd and k r , the system exhibits 
a spontaneous and abrupt phase transition from non-life 
to life (no external tuning), as shown in Fig. [2] The time 
of transition is exponentially distributed (not shown), 
indicative that the transition is first order [23]. Often 
there are many frustrated transitions prior to a success¬ 
ful phase transition (top, Fig. [2]). The frequency that the 
transition will occur is dependent on how well the compo¬ 
sition of the replicator(s) matches the environment (bot¬ 
tom, Fig. [2]). This result is distinct from other models 
that do not account for environmental feedback - 

here the transition is not coincident with the first ‘discov¬ 
ery’ of a sequence capable of replication, since replicators 
exist in non-life. Instead, the transition occurs when the 
dynamic fitness of replicators is high, as occurs when the 
composition of replicators and environment synchronize 
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FIG. 1. Ensemble averaged compositions of all sequences with L < 7. The distributions in the top-panel characterize the 
non-life phase, and the bottom-panel characterize the life phase. Data is averaged over 100 simulations. 
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Difference between replicators and environment 

FIG. 2. Top: Time series showing the mutual information 
between extant replicator composition and the environment. 
The phase transition is clearly evident in the abrupt shift from 
T(R; E ) ~ 2.5 in the non-life phase to I(R\ E) ~ 0 in the life 
phase at t ~ 5500. Also evident are several failed transitions, 
including one that nearly runs to completion before reverting 
back to the non-life phase near t ~ 3000. Bottom: The fre¬ 
quency of successful phase transitions plotted against the dif¬ 
ference between the distribution of free monomers and replica¬ 
tor composition, for an ensemble statistic of 256 simulations. 


their composition. In our example, since both monomer 
species are equally abundant in the initial distribution of 
resources, the nucleation event is typically mediated by 
heterogeneous replicator(s) composed of a roughly equal 
number of ‘0’s and ‘l’s. These are not the sequences 
that are ultimately selected in the life phase, which in¬ 
clude the homogeneous, fit sequences. Thus, in resource- 



FIG. 3. Phase trajectory for an ensemble of 100 systems tran¬ 
sitioning from non-life to life, moving from left to right. Axes 
are the mutual information T(R; E) between replicators and 
environment (x axis) and the ratio of formation (polymeriza¬ 
tion and replication) to degradation rates (y axis). 

limited models like ours, the replicator(s) that nucleates 
the transition will often not be that which is ultimately 
selected. 

However, just as in models without resource restric¬ 
tions, selection ultimately leads to the fixation of fit se¬ 
quences. The transition is accurately tracked by measur¬ 
ing the mutual information X(R\ E ) between the compo¬ 
sition of extant replicators and free monomer resources 
(top panel, Fig. [2]). Prior to the transition, replicators 
and monomers share a high degree of mutual information 
and the composition of replicators generally matches that 
of their environment. However, once life emerges and 
selection reconstitutes the allocation of resources, repli¬ 
cators no longer match the information content of the 
environment and X(R] E) —> 0 with small fluctuations. 
This behavior clearly illustrates how environmental (dy¬ 
namic) selection, with a high degree of mutual informa¬ 
tion between environment and replicators dominates in 
the non-life phase, whereas functional (static) selection, 
which is not dependent on the information content of the 
environment, dominates in the life phase. 

An interesting feature of this phase transition is that, 
due to resource constraints, the selection of replicators 
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FIG. 4. Time series for the extant species population size and 
total number of sequences explored by the system. Linear fits 
to the explored species are shown. The exploration rate is 75% 
faster during the life phase compared to the non- life phase, 
and is 2 orders of magnitude larger during the transition. 


coincides with dynamic restructuring of the environment 
(including both monomer and non-replicating (L < 7) 
sequence populations, see e.g. Fig. [I]). Fig. [3] shows 
an ensemble averaged phase space trajectory through 
this restructuring, which shows that the phase transition 
moving from non-life to life equilibria is highly unsta¬ 
ble and dominated by degradation. In both the non-life 
and life phases, polymer formation rates (polymerization 
and replication) balance rates for polymer degradation, 
with ratios of formation/degradation ~ 1. However, the 
life and non-life phases are clearly distinguished in phase 
space by very different values for the mutual information, 
I(R\ E): T(R; E) ~ 3.0 for non-life and I(R\E) ~ 0.25 
for life, for results in Fig. [3] (see also time series with 
different model parameters, top Fig. [2]). The rampant 
degradation through the phase transition results in a 
rapid and dramatic restructuring of the extant polymer 
population and a steep slope in the rate of sequence ex¬ 
ploration, as observed in Fig. |4j The extant diversity 
and the rate of introduction of new sequences are both 
higher in the life phase than the non-life phase (Fig. [4]), 
attributable to the higher turnover rate of resources in 
the life phase (due to the higher assembly rate of poly¬ 
mers via replication). 

Shown in Fig. [5] is the time evolution for all sequences 
with L = 7, binned by sequence composition, for a set 
of parameters where the transition is prolonged enough 
to resolve details of the restructuring. Resources con¬ 
straints enforce selection of sequences in complementary 
pairs that maintain the symmetry of the bulk resource 
distribution of the environment (50% ‘0’s and 50% ‘l’s). 
The system subsequently undergoes a series of abrupt 
transitions associated with increasing sequence homo¬ 
geneity, where replicator composition increasingly breaks 
the symmetries imposed by the environment. The asym¬ 
metry introduced by replicators is exported to their en¬ 
vironment, as evidenced by the compositions of shorter 



FIG. 5. Series of symmetry breaking transitions in the selec¬ 
tion of fit, homogeneous ’0’ and T’ length L = 7 replicators. 
Here, the subscript denotes the number of T’ monomers in the 
sequence (e.g. xo contains no ‘l’s, Xi bins all polymers with a 
single T’ monomer in their sequence, and xj contains all T’s). 
Parameters are Ay = 0.0005, kd = 0.9000, and k r = 1.000. 

sequences L < 7 and the distribution of monomers (see 
e.g. Fig. []}. 


The Timescale for Life’s Emergence 

The phase transition from non-life to life described 
here is a robust feature of the dynamics, observed over a 
large range of parameters values with qualitatively sim¬ 
ilar features. Quantitative differences arise in the final 
abundances of replicators and in the timescale for the 
transition to occur, which are both sensitive to the spe¬ 
cific details of the prebiotic chemistry under considera¬ 
tion. Fig.[6]shows the average time to complete the phase 
transition as a function of the degradation and replica¬ 
tion rate constants, kd and k r , which as noted earlier may 
be viewed as specifying different environmental contexts 
within which life might emerge. For the results presented, 
the transition was identified as complete when 75% of the 
total replicating mass was allocated in homogeneous (fit) 
sequences. 

One might a priori expect the transition to be most 
rapid (favored) for fast replication (high k r ) and slow 
degradation (low kd), however this is not what is ob¬ 
served. For high degradation rate kd = 5.0, the time 
to the transition is largely independent of Ay (top panel, 
Fig. [§. Lowering the degradation rate (kd = 1.0 and 
kd = 0.5, bottom two panels in Fig. [6j) increases the de¬ 
pendence of the transition time on Ay, which, on average, 
occurs most rapidly for relatively low Ay. This counter¬ 
intuitive behavior arises as a result of the resource con¬ 
straints. For high degradation rates, there is a high rate 
of turnover increasing the likelihood of discovering func¬ 
tionally fit sequences, but the probability of survival is 
low, so the transition time is long regardless of replica¬ 
tive efficiency. For lower degradation rates, high replica- 
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tion rates lock resources in less fit sequences, frustrating 
the system’s restructuring, also leading to long transition 
times. 

The rate of degradative recycling seems to be the 
primary factor in determining the transition timescale. 
Fig- □ shows the transition time observed for different 
abiotic resources abundances, quantified by the ratio R 
of the total number of T’ monomers to total system mass. 
The transition timescale is not expected to be symmet¬ 
ric with respect to the relative abundance of ’0’ and T’ 
monomers. For large values of R (environments rich in 
’0’ monomers that confer stability), where recycling is in¬ 
herently slower, the average transition time may be much 
longer than in environments with fewer stable polymers. 
Our data supports this expectation although the varia¬ 
tion in transition times is large. These features suggest 
that environments which engender degradative recycling 
at a moderate rate may be the most conducive to nucle¬ 
ating the origin of life under resource-limited conditions. 


Hysteresis and Life as a Phase of Matter 


k d = 5.0 


io s 9 



* r = 0.05 kr = 0.1 fcr = 0.5 fcr = 1.0 kr = 5.0 


k d = 1.0 
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kr = 0.005 kr - 0.01 kr = 0.05 = 0.1 *v = 0.5 k r = 1.0 K = 5.0 


To further illustrate the distinction between the non¬ 
life and life phases we can study hysteresis effects. In the 
study of traditional physical phase transitions, partic¬ 
ularly in materials, hysteresis is a common phenomena 
[21] and occurs when characteristic quantities (in this 
case mass in homogeneous “fit” replicators), depend on 
the current state and history of the system [21] ■ Fig. 
[8] shows a hysteresis diagram for the transition between 
the non-life and life phases in our model. Here the sys¬ 
tem is identified as being in the fife phase if the homo¬ 
geneous replicators (be. functionally fit replicators) rep¬ 
resent more than 75 % of the total replicating mass. For 
the parameters presented in Fig. [8] the system can be 
in either the life phases or the non-life phase, however 
it is very sensitive to changes in the degradation rate. 
Holding other parameters fixed and increasing the degra¬ 
dation rate takes the system to a regime where the life 
state is unstable and will transition to the non-life state 
spontaneously. In green (top) the system was initialized 
at kd = 0.1 in the fife phase. The value of kd was then 
slowly increased (by 0.00001 per unit time), until the 
life phase became unstable and the system transitioned 
into non-life. In blue (bottom) is exactly the opposite, 
the system was initialized in non-life and the value of kd 
was decreased (again by 0.00001 per unit time) until the 
system transitioned into the life phase. For the parame¬ 
ters in this example, the non-life phase can persist even 
when there is a substantially large number of replicators 
in the system, be. as occurs when the majority of “repli¬ 
cators” are synthesized via polymerization, and the fit 
(homogeneous) replicators do not dominate the resource 
distribution. Thus the system can transition from life to 
non-life, distinguished purely by whether or not their is 


k d = 0.5 
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kr = 0.005 kr = 0.01 kr = 0.05 K = 0.1 K = 0.5 k r = 1.0 K = 5.0 


FIG. 6. Timescale for completing the phase transition as a 
function of reaction rate constants for degradative recycling 
kd and replication k r . Data from 25 simulations is shown, 
all data points are included in the box and whisker plots. 
The center line for each distribution is the median, the boxes 
contain half the data points and the bars show the range. 



FIG. 7. Timescale for completing the phase transition as a 
function of the abiotic distribution of resources. Here, the 
parameter R is the ratio of T’ monomers (which confer sta¬ 
bility) to total system mass. Data from over 100 simulations 
is shown, all data points are included in the box and whisker 
plots. The center line for each distribution is the median, 
the boxes contain half the data points and the bars show the 
range. 
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FIG. 8. Hysteresis diagram following the transition from life 
to non-life and back to life. Rate constants are k p = 0.0005 
and k r = 0.001 with the degradation rate slowly adjusted in 
steps of kd = 0.00001 per unit time to modulate the hysteresis 
effect. 

selection on the properties of replicators. 


DISCUSSION 

We have demonstrated the existence of a spontaneous, 
first order phase transition from non-life to life that 
demonstrates many features not previously observed, 
which arise due to explicit incorporation of environmen¬ 
tal feedback. It might be argued that the dynamics re¬ 
ported here do not represent a true phase transition. In 
the study of equilibrium physical systems, free energy is 
the quantity which is minimized to determine the state 
of the system [23] . Typically, this involves a play-off be¬ 
tween minimizing total energy and maximizing entropy. 
When these two favor different results, a system is ex¬ 
pected to exhibit a first order phase transition from or¬ 
der to disorder. Here, in our dynamical scenario, a sim¬ 
ilar tradeoff happens between two processes that con¬ 
sume and try to minimize the number of free monomers 
(which may be related to the minimization of free energy 
[25]'). These two different ways— viz., maximizing the 
number of bonds via polymerization, or by maximizing 
the number of polymers via replication—yield distinct 
results with a sharp boundary between them, which mo¬ 
tivates the classification of the observed dynamics as a 
phase transition. 

Due to the explicit coupling between replicators and 
environment, the phase transition reported here displays 
many features one might expect for a newly emergent 
biosphere that are not observable in open-flow reactor 
models. In particular, restructuring during the phase 
transition drives a vast increase in extant diversity and 
in the rate of exploration of novel diversity. This indi¬ 
cates that the emergence of life should coincide with an 
explosive growth of novelty in resource limited systems. 
Concomitantly, during the phase transition the system is 


dramatically restructured, indicating that the emergence 
of life should have significantly altered the environment 
of the early Earth. It is well known that biology alters its 
environment over generational and geological timescales, 
and that the presence of life defines many features of 
the Earth-system. It is interesting to observe this as a 
generic feature of life, characteristic of life as a phase of 
matter from its first inception, even in an abstract and 
simplistic model as presented here. In fact, we observe 
that if replicators do not transform their environment to 
enable their selection, the phase transition fails to com¬ 
plete. The model therefore includes the possibility of 
many frustrated trials before life first emerged (see e.g. 
Fig. §, with success entailing a transformation of the 
environment as a necessary component of the process of 
biogenesis. These features indicate that it should be diffi¬ 
cult to retrace the precise history of the origin of life: the 
replicators that are ultimately selected will, in general, 
neither be reflective of the ancient planetary environment 
nor be representative of the replicators that first nucle¬ 
ate the phase transition. Thus, in general the conditions 
favoring the emergence of life may not be the same as 
those favoring its subsequent evolution. 

Interestingly, the features characteristic of the phase 
transition are heavily dependent on degradative recy¬ 
cling of finite resources. This suggests new perspectives 
regarding the role of degradation in the origin of life, 
which is typically viewed as an impediment in prebi- 
otic chemistry, rather than a process central to early 
evolution (26]. Cast under new light in the resource- 
constrained dynamics observed here, it is perhaps not 
a coincidence that RNA, as a biopolymer that played a 
prominent role in early evolution, is highly susceptible 
to hydrolysis, perhaps resolving an apparent paradox in 
the origin of life[27] . The properties of this phase transi¬ 
tion are in principle testable in the laboratory in experi¬ 
mental systems that permit recycling of biopolymers, for 
example as reported in [15] . In particular, the observed 
dynamics should place further constraints on the kinds 
of chemistries (defined by relative rates kd, k r and k p 
and resources abundances) that are most conducive to 
mediating the transition to living matter (see also e.g. 

EH])- 

Importantly, the most distinctive feature of the life 
phase in our model is not necessarily the presence of repli¬ 
cators, since these also exist in non-life. Instead it is se¬ 
lection on the properties of replicators, such as replicative 
efficiency and stability in the example presented here. 
Selection in turn necessitates a redistribution of matter 
due to resources constraints. This restructuring is coin¬ 
cident with a sharp transition in the mutual information 
between the composition of replicators and the distribu¬ 
tion of free monomers, which accurately tracks the phase 
transition. Previous work connecting information the¬ 
ory to life’s origins reported that the probability to dis¬ 
cover a self-replicator by chance should depend exponen- 
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tially on the availability of its composite monomers 129] . 
Our results demonstrate an additional necessary feature 
is that, in the case of resource constrained replication, 
replicators and environment synchronize their composi¬ 
tion (share high mutual information). Such synchroniza¬ 
tion enables exponential growth of the replicator popula¬ 
tion based on high dynamic fitness, which in turn enables 
selection on the properties of new replicators discovered. 
When these replicators do not match the bulk compo¬ 
sition of the system, they force restructuring to accom¬ 
modate their selection. We further note that very few 
measures have been proposed to explicitly quantify the 
origin of life transition. Here, mutual information be¬ 
tween replicators and environment accurately measures 
the progress of the phase transition reported, perhaps 
acting as an order parameter. Future work should iden¬ 
tify how broadly applicable this approach is, by applying 
mutual information to other candidate scenarios for the 
origin of life, such as in the formation of autocatalytic 
sets. 

We have identified replicators with “life” in this sim¬ 
ple model. However, the definition of life is an important 
open philosophical and scientific question [3D]. The ob¬ 
served information-theoretic properties of this transition 
are consistent with proposals that life is most defined by 
its informational properties [31j (here, replicators might 
be interpreted as driving the dynamics of the entire sys¬ 
tem in a “top-down” manner due to adaptive selection 
[52]). The life phase may be interpreted a state where 
the kinetics of individual replicators ( e.g. as quantified 
by their replicative efficiency and stability) dictate the 
behavior of the entire system, consistent with the notion 
that life is a kinetically driven state of matter iS]. Al¬ 
though our motivation is to understand the origin of life 
utilizing this model system, we note that the model is 
sufficiently abstract to capture features that may be uni¬ 
versal to a broader class of evolutionary transitions. In 
particular, the dynamics could be universally character¬ 
istic of the discovery of novel, selectable patterns in the 
distribution of resources among replicating populations. 
For example, the abrupt nature of the transition shares 
features in common with punctuated equilibrium 33] . 
The dynamics of this phase transition also demonstrate 
behavior that may be characteristic to mass extinctions: 
the system’s restructuring necessitates a period of insta¬ 
bility driven by rampant destruction of extant diversity, 
which is followed by an explosion in novel diversity. The 
relationship to the phase transition reported here could 
be tested, for example, by analyzing the connection be¬ 
tween resource distribution patterns and abrupt evolu¬ 
tionary transitions. 

Finally, we point out that from the perspective of sta¬ 
bly propagating informational patterns (replicators) that 
are decoupled from those imposed by the bulk environ¬ 
ment, simple replicators such as those presented here 
may not be the most effective architecture for a self- 


reproducing system. In this model, the total composition 
of the system remains fixed, what life does is restructure 
the distribution of matter within the system, due to the 
propagation of selectable replicating resource allocation 
patterns. An interesting open question is how this phase 
transition might play out for more life-like replicative sys¬ 
tems, such as those with the architecture of a von Neu¬ 
mann self-reproducing automata G21G2IE5], a subject 
we leave to future work. 
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